103 



208/007 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Thomas Ciossek, Axel Ullrich, Birgit 

Millauer 



(ii) TITLE OF INVENTION: 



(iii) NUMBER OF SEQUENCES: 

(iv) CORRESPONDENCE ADDRESS 

(A) ADDRESSEE: 

(B) STREET: 

(C) CITY: 

(D) STATE: 

(E) COUNTRY: 

(F) ZIP: 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: ' 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: to be assigned 

(B) FILING DATE: January 3, 1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

Prior applications total, 
including application 

described below: none 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY/ AGENT INFORMATION: 



METHODS FOR DIAGNOSIS AND TREATMENT 
OF MDKl SIGNAL TRANSDUCTION DISORDERS 



Lyon & Lyon 
633 West Fifth Street 
Los Angeles 
California 
USA 
90071 



3.5" Diskette, 1.44 Mb 
IBM compatible 
IBM P.O. DOS (Version 5.0) 
WordPerfect (Version 5-1) 



(A) NAME: 

(B) REGISTRATION NUMBER: 

(C) REFERENCE/DOCKET NUMBER: 



Warburg, Richard J. 

32,327 

208/007 



104 208/007 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE 

(B) TELEFAX: 

(C) TELEX: 

(1) INFORMATION FOR SEQUENCE ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 04 

(B) TYPE: nucleic acid 

(C) STRANDNESS: single 

(D) TOPOLOGY: linear 

u& (ii) MOLECULE TYPE: nucleic 

h 

g (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



i^GCGGCCGG TCTGCAGTCG 


GAGACTTGCA 


GGCAGCAAAC 


ACGGTGCGAA 


50 


m 

CfgAACCGGAG GGGGGAGAGA 


GAAATCAAAC 


AGCTAAGCGT 


GGAGCAGACG 


100 


(|qCTGGGACC 


CAGAAGGGGA 


TCGATGCGAG 


GAGCGCAATA 


ATAACAACAA 


150 


TAATAACCCA 


CTTCGGAGCA 


AACAGCATCT 


AAAGAGCTGC 


GACCCAACTG 


200 


dJlGCCTAAAA 


AAATCAAACC 


TGCTCATGCA 


CCATGGTTGT 


TCAAACTCGG 


250 


oflyCCCTTCGT 


GGATTATTTT 


GTGTTACATC 


TGGCTGCTTG 


GCTTTGCACA 


300 


CACGGGGGAG 


GCGCAGGCTG 


CGAAGGAAGT 


ACTATTACTG 


GACTCGAAAG 


350 


CACAACAAAC 


AGAATTGGAA 


TGGATTTCCT 


CTCCACCCAG 


TGGGTGGGAA 


400 


GAAATTAGTG 


GTTTGGATGA 


GAACTACACT 


CCGATAAGAA 


CATACCAGGT 


450 


GTGCCAGGTC 


ATGGAGCCCA 


ACCAGAACAA 


CTGGCTGCGG 


ACTAACTGGA 


500 


TTTCTAAAGG 


CAACGCACAA 


AGGATTTTTG 


TAGAATTGAA 


ATTCACCTTG 


550 


AGGGATTGTA 


ATAGTCTTCC 


CGGAGTCCTG 


GGAACTTGCA 


AGGAAACGTT 


600 


TAATTTGTAC 


TATTATGAAA 


CAGACTACGA 


CACCGGCAGG 


AATATACGAG 


650 


AAAACCTTTA 


TGTTAAAATA 


GACACCATTG 


CTGCAGATGA 


AAGTTTCACA 


700 


CAAGGTGACC 


TTGGTGAAAG 


AAAGATGAAG 


CTGAACACTG 


AGGTGAGAGA 


750 


GATTGGACCT 


TTGTCCAAAA 


AGGGATTCTA 


TCTTGCCTTT 


CAGGATGTAG 


800 


GGGCTTGCAT 


AGCATTGGTT 


TCTGTCAAAG 


TGTACTACAA 


GAAGTGCTGG 


850 
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ACCATTGTTG 


AGAACTTAGC 


TGTCTTTCCA 


GATACAGTGA 


CTGGTTCGGA 


900 


ATTTTCCTCC 


TTAGTCGAGG 


TCCGTGGGAC 


ATGTGTCAGC 


AGTGCCGAGG 


950 


AAGAGGCAGA 


AAATTCCCCC 


AGAATGCATT 


GCAGTGCAGA AGGAGAGTGG 


1000 


CTAGTACCCA 


TTGGAAAATG 


CATCTGCAAA 


GCAGGCTATC 


AGCAAAAAGG 


1050 


GGACACTTGC 


GAACCCTGTG 


GCCGCAGGTT 


CTACAAATCT 


TCCTCTCAGG 


1100 


ATCTCCAGTG 


TTCTCGTTGT 


CCAACCCACA 


GCTTCTCTGA 


CCGAGAAGGA 


1150 


TCATCCAGGT 


GTGAATGTGA 


AGATGGGTAC 


TACAGAGCTC 


CTTCTGATCC 


1200 


ACCATACGTT 


GCATGCACGA 


GGCCTCCCTC 


TGCACCACAG 


AACCTTATTT 


1250 


■JfAATATCAA TCAAACGACT 


GTAAGTTTGG 


AATGGAGTCC 


TCCGGCTGAC 


1300 


S^CGGGGGAA 


GAAACGATGT 


CACCTACAGA 


ATACTGTGTA 


AGCGGTGCAG 


1350 


T^GGGAACAG 


GGAGAATGTG 


TGCCATGCGG 


AAGTAACATT 


GGATACATGC 


1400 


(i^CAGCAGAC 


GGGATTAGAG 


GATAACTATG 


TCACTGTCAT 


GGACCTACTT 


1450 


deCCATGCAA 


ATTACACTTT 


CGAAGTTGAA 


GCTGTAAATG 


GAGTTTCGGA 


1500 


cK-TAAGCAGA 


TCCCAGAGGC 


TCTTCGCTGC 


TGTTAGCATC 


ACCACCGGTC 


1550 


Mgcagctcc 

lli 


CTCGCAAGTG 


AGTGGAGTCA 


TGAAGGAGCG 


AGTACTGCAG 


1600 


dSGAGTGTGC 

|1i 


AGCTTTCCTG 


GCAGGAGCCG 


GAGCATCCCA 


ATGGAGTCAT 


1650 


CACGGAATAT 


GAAATCAAGT 


ATTATGAGAA 


AGATCAACGG 


GAAAGGACGT 


1700 


ACTCAACACT 


CAAAACCAAG 


TCCACCTCCG 


CCTCCATTAA 


TAATCTGAAA 


1750 


CCGGGAACAG 


TGTACGTCTT 


TCAGATCCGG 


GCGGTCACTG 


CTGCCGGTTA 


1800 


TGGAAACTAC 


AGCCCTAGGC 


TTGATGTTGC 


CACACTTGAG 


GAAGCTTCAG 


1850 


GTAAAATGTT 


TGAAGCGACA 


GCAGTCTCCA 


GTGAACAGAA 


TCCTGTCATC 


1900 


ATAATTGCTG 


TAGTGGCTGT 


AGCAGGGACC 


ATCATCTTGG 


TGTTCATGGT 


1950 


GTTCGGCTTC 


ATCATTGGAA 


GAAGGCACTG 


TGGTTATAGG 


AAGGCTGACC 


2000 


AAGAAGGGGA 


TGAAGAACTC 


TACTTTCATT 


TTAAATTTCC 


AGGCACCAAA 


2050 


ACCTACATTG 


ACCCTGAAAC 


CTATGAGGAC 


CCAAATAGAG 


CTGTCCATCA 


2100 


ATTCGCCAAG 


GAGCTAGATG 


CCTCCTGTAT 


TAAAATTGAG 


CGTGTGATTG 


2150 


GTGCAGGAGA 


ATTTGGAGAA 


GTTTGCAGTG 


GTCGTTTGAA 


ACTTCCGGGC 


2200 


CAGAGAGATG 


TTGCAGTGGC 


CATAAAAACC 


CTGAAAGTTG 


GTTACACAGA 


2250 
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AAAGCAAAGG 


AGGGACTTTT 


TATGCGAAGC 


AAGCATCATG 


GGGCAATTTG 


2300 


ACCACCCAAA 


TGTCGTCCAT 


TTGGAAGGGG 


TTGTTACAAG 


AGGGAAGCCT 


2350 


GTCATGATTG 


TGATAGAGTT 


CATGGAGAAT 


GGAGCCCTGG 


ATGCATTTCT 


2400 


CAGGAAACAC 


GATGGGCAGT 


TTACAGTCAT 


TCAGTTGGTA 


GGAATGTTGA 


2450 


GAGGTATTGC 


CGCTGGGATG 


CGATACTTGG 


CTGATATGGG 


ATACGTTCAC 


2500 


AGGGACCTTG 


CAGCGCGCAA 


CATCCTTGTC 


AACAGCAATC 


TTGTTTGTAA 


2550 


AGTGTCAGAT 


TTTGGCCTTT 


CCCGGGTTAT 


AGAGGATGAT 


CCCGAAGCTG 


2600 


TCTACACCAC 


GACTGGTGGA 


AAAATTCCAG 


TAAGGTGGAC 


TGCACCGGAA 


2650 


f^CATTCAAT 


ACCGGAAGTT 


CACCTCAGCC 


AGCGATGTGT 


GGAGCTATGG 


2700 


a 

tATTGTCATG 


TGGGAAGTGA 


TGTCTTATGG 


AGAAAGACCT 


TACTGGGACA 


2750 


■'-'-.1 

Tj^JTCAAATCA AGATGTCATT AAAGCGATAG AAGAAGGTTA TCGTTTGCCG 


2800 


(^GCCCATGG 


ATTGCCCAGC 


TGGTCTTCAC 


CAGCTAATGC 


TGGATTGTTG 


2850 


(fCAGAAAGAT 


CGGGCGGAAA 


GGCCAAAGTT 


TGAGCAGATA 


GTCGGAATTC 


2900 


tAGACAAAAT 


GATTCGAAAC 


CCAAGTAGTC 


TGAAAACACC 


CCTGGGAACT 


2950 


t^TAGTAGAC CCTTAAGCCC TCTTCTGGAC CAGAGCACTC CTGACTTCAC 


3000 


■j^CCTTCTGT 


TCAGTTGGAG 


AATGGTTGCA 


AGCTATTAAA 


ATGGAAAGGT 


3050 


ATAAGGACAA 


CTTCACAGCA 


GCGGGTTACA 


ACTCACTCGA 


GTCAGTGGCC 


3100 


AGGATGACTA 


TCGATGATGT 


GATGAGTTTA 


GGGATCACAC 


TGGTTGGCCA 


3150 


TCAAAAGAAG 


ATCATGAGCA 


GCATCCAGAC 


TATGCGGGCA 


CAAATGTTGC 


3200 


ATTTACACGG 


AACAGGCATC 


CAAGTGTGAC 


ACATCGGCCT 


CCCTCAGATG 


3250 


AGGCTTAAGA 


CTGCAGGAGA 


ACAGTTCTGG 


CCTTCAGTAT 


ACGCATAGAA 


3300 


TGCTGCTAGA 


AGACAGTTGA 


TATACTGGGT 


CCTTCCTACA 


AGAAAGAGAA 


3350 


GATTTTAGAA 


GCACCTCCAG 


ACTTGAACTC 


CTAAGTGCCA 


CCAGAATATA 


3400 


CAAAAAGGGA 


ATTTAGGATC 


CACCACTGGT 


GGCCAGGAAC 


ACAGCAGAGA 


3450 


CAATAAACAA 


AGTACTACCT 


GAAAAACATC 


CCAACACCTT 


GAGCTCTCGA 


3500 


ACCTCCTTTT 


TATCTTATAG 


ACTTTTTAAA 


AATGTACATA 


AAGAATTTAA 


3550 


GAAAGAATAT 


ATTTGTCAAA 


TAAAAATCAT 


GATCTTATTG 


TTAAAATCAA 


3600 


TGAAATATTT 


TCCTTAAAAT 


ATGTGATTTC 


AGACTATTCT 


TTTCCAGAAC 


3650 
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CATCTGTGTT TATTCTGCTT AAGGACTTTG TTTTAGAAAG TTATTTGTAG 3 700 

CTTTGGACCT TTTTAGTGTT AAATTTATGA CACGTTACTA CACTGGGAAC 3750 

CTTTGAAGAC TCTCAAACTT AAAGGAAAGC AAAACTACGC ACATAGTCGA 3 8 00 

GGATGGACTT TGTCCTTCAT GGCTTTGGTA TCCTGGCTGT GTCATTTTGT 3 850 

TAAACCAGTG ATGTTTTCAT ATTGTTTGCT GATTGGCAGG TAGTTCAAAA 3 900 

TTGCAAGTTG CCAAGAGCTC TGATATTTTT TAACAGGATT TTTTTTTCTT 3 950 

TGTAAAAATC AGATAACATA CTAACTTTTC AATGAAAAAA AAAAAAAAAG 4000 

AAGCAATAAT GATCCATAAA TACTATAAGG CACTTTTAAC AGATTGTTTA 405 0 

i;:|i.GAGTGATT TACTAGGCAG AATTTAATAA AAAAAAAAGA GAGATGTCAA 4100 

^TTTAGGTT TATGTGTATA TGATAAAAGG CTGAGCTTCG TCTGAAGATG 4150 

(p^'GGTGAAAG CAAGACTGGA AGCGAAGCTC TCCAGCTTTG GCTAACCCAA 42 0 0 

■IpJlCGAGCACA TCAAGAGCTT CAGTCTTGTG ACAGTAAGAA ATTTAGGAAC 4250 

M'AGTTGACC TATATTTTGT ATTCTTTCTT GTTGAATGCA GTCCAAATAC 43 0 0 

^|aA 4304 

ft) INFORMATION FOR SEQUENCE ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 998 

(B) TYPE: amino acid 

(C) STRANDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 



Met Val Val Gin Thr Arg Phe Pro Ser Trp lie lie Leu Cys Tyr lie 
15 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp lie 
35 40 45 



Ser Ser Pro Pro Ser Gly Tirp Glu Glu He Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lys Gly Asn Ala Gin 
85 90 95 

Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 

100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 120 125 

||u Thr Asp Tyr Asp Thr Gly Arg Asn He Arg Glu Asn Leu Tyr Val 
5 130 135 140 

%1 

ijs He Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
|§5 150 155 160 

(fly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu He Gly Pro 
fj 165 170 175 

ilu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 

^ 180 185 190 

Me Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr He 
195 200 205 

Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 
225 230 235 240 

Glu Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 
245 250 255 

Leu Val Pro He Gly Lys Cys He Cys Lys Ala Gly Tyr Gin Gin Lys 
260 265 270 

Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Arg 
290 295 300 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 
305 310 315 320 



Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 
325 330 335 

Asn Leu lie Phe Asn lie Asn Gin Thr Thr Val Ser Leu Glu Trp Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg lie Leu 
355 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Gly Ser 
370 375 380 

Asn lie Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 

385 390 395 400 

Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 
M. 405 410 415 

A^|a Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 

T] 420 425 430 

Ma Val Ser He Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 

Sp^ 435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 

^ 450 455 460 

Glu Pro Glu His Pro Asn Gly Val He Thr Glu Tyr Glu He Lys Tyr 
^5 470 475 480 

':^r Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 
ny 485 490 495 

Ser Thr Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 

500 505 510 

Phe Gin He Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 
515 520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 
530 535 540 

Ala Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He Ala Val 

545 550 555 560 

Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe Gly Phe 
565 570 575 

He He Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly 
580 585 590 

Asp Glu Glu Leu Tyr Phe His Phe Lys Phe Pro Gly Thr Lys Thr Tyr 
595 600 605 



He Asp Pro Glu Thr Tyr Glu Asp Pro Asn Arg Ala Val His Gin Phe 
610 615 620 

Ala Lys Glu Leu Asp Ala Ser Cys He Lys He Glu Arg Val He Gly 
625 630 635 640 

Ala Gly Glu Phe Gly Glu Val Cys Ser Gly Arg Leu Lys Leu Pro Gly 
645 650 655 

Gin Arg Asp Val Ala Val Ala He Lys Thr Leu Lys Val Gly Tyr Thr 
660 665 670 

Glu Lys Gin Arg Arg Asp Phe Leu Cys Glu Ala Ser He Met Gly Gin 
675 680 685 

Phe Asp His Pro Asn Val Val His Leu Glu Gly Val Val Thr Arg Gly 

hA 700 

&s Pro Val Met He Val He Glu Phe Met Glu Asn Gly Ala Leu Asp 

tyS 710 715 720 

il-a Phe Leu Arg Lys His Asp Gly Gin Phe Thr Val He Gin Leu Val 

y 725 730 735 

m 

dly Met Leu Arg Gly He Ala Ala Gly Met Arg Tyr Leu Ala Asp Met 
L, 74 0 74 5 750 

ily Tyr Val His Arg Asp Leu Ala Ala Arg Asn He Leu Val Asn Ser 

H 755 760 765 

iCin Leu Val Cys Lys Val Ser Asp Phe Gly Leu Ser Arg Val He Glu 

III 770 775 780 

Asp Asp Pro Glu Ala Val Tyr Thr Thr Thr Gly Gly Lys He Pro Val 

785 790 795 800 

Arg Trp Thr Ala Pro Glu Ala He Gin Tyr Arg Lys Phe Thr Ser Ala 
805 810 815 

Ser Asp Val Trp Ser Tyr Gly He Val Met Trp Glu Val Met Ser Tyr 
820 825 830 

Gly Glu Arg Pro Tyr Trp Asp Met Ser Asn Gin Asp Val He Lys Ala 

835 840 845 

He Glu Glu Gly Tyr Arg Leu Pro Ala Pro Met Asp Cys Pro Ala Gly 
850 855 860 

Leu His Gin Leu Met Leu Asp Cys Trp Gin Lys Asp Arg Ala Glu Arg 
865 870 875 880 

Pro Lys Phe Glu Gin He Val Gly He Leu Asp Lys Met He Arg Asn 
885 890 895 



Pro Ser Ser Leu Lys Thr Pro Leu Gly Thr Cys Ser Arg Pro Leu Ser 
900 905 910 

Pro Leu Leu Asp Gin Ser Thr Pro Asp Phe Thr Ala Phe Cys Ser Val 
915 920 925 

Gly Glu Trp Leu Gin Ala lie Lys Met Glu Arg Tyr Lys Asp Asn Phe 
930 935 940 

Thr Ala Ala Gly Tyr Asn Ser Leu Glu Ser Val Ala Arg Met Thr lie 
945 950 955 960 

Asp Asp Val Met Ser Leu Gly lie Thr Leu Val Gly His Gin Lys Lys 
965 970 975 

lie Met Ser Ser lie Gin Thr Met Arg Ala Gin Met Leu His Leu His 

1=* 980 985 990 

fe.y Thr Gly He Gin Val 

^ 995 



INFORMATION FOR SEQUENCE ID NO : 3 : 
(i) SEQUENCE CHARACTERISTICS: 



i) 

p 

JI (A) LENGTH: 610 

S, (B) TYPE: amino acid 

^ (C) STRANDNESS: single 

y (D) TOPOLOGY: linear 

ill 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Met Val Val Gin Thr Arg Phe Pro 
1 5 

Trp Leu Leu Gly Phe Ala His Thr 
20 

Val Leu Leu Leu Asp Ser Lys Ala 
35 40 

Ser Ser Pro Pro Ser Gly Trp Glu 
50 55 

Tyr Thr Pro He Arg Thr Tyr Gin 
65 70 



Ser Trp He He Leu Cys Tyr He 
10 15 

Gly Glu Ala Gin Ala Ala Lys Glu 
25 30 

Gin Gin Thr Glu Leu Glu Trp He 
45 

Glu He Ser Gly Leu Asp Glu Asn 
60 

Val Cys Gin Val Met Glu Pro Asn 
75 80 



Gin Asn Asn Trp Leu Arg Thr Asn Trp lie Ser Lys Gly Asn Ala Gin 
85 90 95 

Arg lie Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn lie Arg Glu Asn Leu Tyr Val 
130 135 140 

Lys lie Asp Thr lie Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
145 150 155 160 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu lie Gly Pro 
m ^"7° I'^s 

^u Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 

1^ 180 185 190 

5^e Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr lie 

195 200 205 

^^1 Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

ifer Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 
1^5 230 235 240 

§3.u Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 
W . 245 250 255 

Leu Val Pro lie Gly Lys Cys lie Cys Lys Ala Gly Tyr Gin Gin Lys 
260 265 270 

Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Arg 
290 295 300 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 
305 310 315 320 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 
325 330 335 

Asn Leu lie Phe Asn lie Asn Gin Thr Thr Val Ser Leu Glu Trp Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg lie Leu 
355 360 365 



Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Gly Ser 
370 375 380 

Asn lie Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 
385 390 395 400 

Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 
405 410 415 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
420 425 430 

Ala Val Ser lie Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 
435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 

y& 450 455 460 

flu Pro Glu His Pro Asn Gly Val lie Thr Glu Tyr Glu lie Lys Tyr 

4J5 470 475 480 

ijyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 

Sfi 485 490 495 

Skr Thr Ser Ala Ser lie Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 

q 500 505 510 

P^ie Gin lie Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 
515 520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 



530 535 540 



Ala Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He Ala Val 

545 550 555 560 

Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe Gly Phe 
565 570 575 

He He Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly 
580 585 590 

Asp Glu Glu Leu Tyr Phe His Ser Leu Val Thr Asn Glu His Leu Ser 
595 600 605 

Val Leu 
610 



114 20 
(1) INFORMATION FOR SEQUENCE ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2901 

(B) TYPE: nucleic acid 

(C) STRANDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

AAGCGGCCGG TCTGCAGTCG GAGACTTGCA GGCAGCAAAC ACGGTGCGAA 5 0 

gpAACCGGAG GGGGGAGAGA GAAATCAAAC AGCTAAGCGT GGAGCAGACG 100 

^CTGGGACC CAGAAGGGGA TCGATGCGAG GAGCGCAATA ATAACAACAA 150 

IIaTAACCCA CTTCGGAGCA AACAGCATCT AAAGAGCTGC GACCCAACTG 200 

rn 

^GCCTAAAA AAATCAAACC TGCTCATGCA CCATGGTTGT TCAAACTCGG 250 

#CCCTTCGT GGATTATTTT GTGTTACATC TGGCTGCTTG GCTTTGCACA 3 00 

||CGGGGGAG GCGCAGGCTG CGAAGGAAGT ACTATTACTG GACTCGAAAG 3 50 

jl-CAACAAAC AGAATTGGAA TGGATTTCCT CTCCACCCAG TGGGTGGGAA 400 

(SIlAATTAGTG GTTTGGATGA GAACTACACT CCGATAAGAA CATACCAGGT 450 

GTGCCAGGTC ATGGAGCCCA ACCAGAACAA CTGGCTGCGG ACTAACTGGA 500 

TTTCTAAAGG CAACGCACAA AGGATTTTTG TAGAATTGAA ATTCACCTTG 55 0 

AGGGATTGTA ATAGTCTTCC CGGAGTCCTG GGAACTTGCA AGGAAACGTT 600 

TAATTTGTAC TATTATGAAA CAGACTACGA CACCGGCAGG AATATACGAG 650 

AAAACCTTTA TGTTAAAATA GACACCATTG CTGCAGATGA AAGTTTCACA 700 

CAAGGTGACC TTGGTGAAAG AAAGATGAAG CTGAACACTG AGGTGAGAGA 750 

GATTGGACCT TTGTCCAAAA AGGGATTCTA TCTTGCCTTT CAGGATGTAG 8 00 

GGGCTTGCAT AGCATTGGTT TCTGTCAAAG TGTACTACAA GAAGTGCTGG 850 

ACCATTGTTG AGAACTTAGC TGTCTTTCCA GATACAGTGA CTGGTTCGGA 900 

ATTTTCCTCC TTAGTCGAGG TCCGTGGGAC ATGTGTCAGC AGTGCCGAGG 950 

AAGAGGCAGA AAATTCCCCC AGAATGCATT GCAGTGCAGA AGGAGAGTGG 10 0 0 

CTAGTACCCA TTGGAAAATG CATCTGCAAA GCAGGCTATC AGCAAAAAGG 105 0 
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20£ 


GGACACTTGC 


GAACCCTGTG 


GCCGCAGGTT 


CTACAAATCT 


TCCTCTCAGG 


1100 


ATCTCCAGTG 


TTCTCGTTGT 


CCAACCCACA 


GCTTCTCTGA 


CCGAGAAGGA 


1150 


TCATCCAGGT 


GTGAATGTGA 


AGATGGGTAC 


TACAGAGCTC 


CTTCTGATCC 


1200 


ACCATACGTT 


GCATGCACGA 


GGCCTCCCTC 


TGCACCACAG 


AACCTTATTT 


1250 


TCAATATCAA 


TCAAACGACT 


GTAAGTTTGG 


AATGGAGTCC 


TCCGGCTGAC 


1300 


AACGGGGGAA 


GAAACGATGT 


CACCTACAGA 


ATACTGTGTA 


AGCGGTGCAG 


1350 


TTGGGAACAG 


GGAGAATGTG 


TGCCATGCGG 


AAGTAACATT 


GGATACATGC 


1400 


CCCAGCAGAC 


GGGATTAGAG 


GATAACTATG 


TCACTGTCAT 


GGACCTACTT 


1450 


GCCCATGCAA 


ATTACACTTT 


CGAAGTTGAA 


GCTGTAAATG 


GAGTTTCGGA 


1500 


eifTAAGCAGA 


TCCCAGAGGC 


TCTTCGCTGC 


TGTTAGCATC 


ACCACCGGTC 


1550 


iiGCAGCTCC 


CTCGCAAGTG 


AGTGGAGTCA 


TGAAGGAGCG 


AGTACTGCAG 


1600 


i^GAGTGTGC 


AGCTTTCCTG 


GCAGGAGCCG 


GAGCATCCCA 


ATGGAGTCAT 


1650 


dACGGAATAT 


GAAATCAAGT 


ATTATGAGAA 


AGATCAACGG 


GAAAGGACGT 


1700 


aItcaacact 

fli 


CAAAACCAAG 


TCCACCTCCG 


CCTCCATTAA 


TAATCTGAAA 


1750 


ifeSGGGAACAG 


TGTACGTCTT 


TCAGATCCGG 


GCGGTCACTG 


CTGCCGGTTA 


1800 


®GAAACTAC 


AGCCCTAGGC 


TTGATGTTGC 


CACACTTGAG 


GAAGCTTCAG 


1850 


GTAAAATGTT 


TGAAGCGACA 


GCAGTCTCCA 


GTGAACAGAA 


TCCTGTCATC 


1900 


ATAATTGCTG 


TAGTGGCTGT 


AGCAGGGACC 


ATCATCTTGG 


TGTTCATGGT 


1950 


GTTCGGCTTC 


ATCATTGGAA 


GAAGGCACTG 


TGGTTATAGC 


AAGGCTGACC 


2000 


AAGAAGGGGA 


TGAAGAACTC 


TACTTTCATT 


CTTTAGTAAC 


AAATGAGCAC 


2050 


CTGTCAGTTT 


TATAAACCGC 


AACAATAACT 


GTTTAAGACA 


ATCAATTTTG 


2100 


GATAAACAAT 


CAACTACAGC 


AGAATAAATC 


AAGATTTTTA 


AGTCCCATTT 


2150 


TCCTTTATAC 


ATTCTGCTTA 


TTTTGTTGTT 


ATATGTTTAT 


TTTTTAAACT 


2200 


CTGATCTTGA 


TTGAATGTGA 


TACCATAAGC 


ACAGTTAGGC 


TGCAGTGTAA 


2250 


ATATATAAAG 


ACATTGTTCT 


GAGAGCAGTA 


CGATTTCATG 


GAAAGATTGT 


2300 


TTGGTGGCTT 


TGTTAAAATT 


AATAAAGAAT 


TTTTAAGGAT 


ATAGTGTAAT 


2350 


TTTCTTCATT 


GCATTAATAT 


AACCAAATAT 


GCCTACCTAT 


CTTTGTCTTG 


2400 


AACCAAATGA 


ATAGATTTGG 


AATACTTTAT 


TGTAATTGAA 


TTTGATATAA 


2450 
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AGTTGACTGA GCATTTATGT GTTACCTGCA TGCTTCTGGG TGCATTGAAA 2500 

TATTTTAACT TTTAAAATGA TACTATGTTG TTTCAATTTT GACTACCTTT 255 0 

TGTGAGGCAT ACTGGCTACC TCCTCCTATT AGCTAAGATC TTCCAAAGCC 2600 

TTATAATGAA AAGTTTATAT AAACCATTTC TCTTTCAAAT CACTGTCATA 2650 

CTTGGTCACG GATCCCAGGA ATATTGTAAA TTTTCTAATT TACTCTGCAC 2 70 0 

TTTGTATATC CAGCCTCTAT TACCCTCAAG GTGAATATAA AACTATGTCT 2 75 0 

TTTGAATATT TCTCTTTGAT TTTGTGATAG CAGTCCCTCA TATCTTGTAC 2800 

TAATTTTATG TATATGTCAA CAGTGGTTGG TCTTTAAAAA TAAATCAAAG 2 85 0 

^TAAGTAAA AAAAAAAAAA AAAAAAAAAA AAAAATAAAA AAAAAAAAAA 2900 



2901 



f4) INFORMATION FOR SEQUENCE ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

Q (A) LENGTH: 62 6 

(B) TYPE: amino acid 

Ni (C) STRANDNESS: single 

(D) TOPOLOGY: linear 

fJ (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



Met Val Val Gin Thr Arg Phe Pro Ser Trp He He Leu Cys Tyr He 
15 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp He 
35 40 45 

Ser Ser Pro Pro Ser Gly Trp Glu Glu He Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 

65 70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lys Gly Asn Ala Gin 
85 90 95 



Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn He Arg Glu Asn Leu Tyr Val 
130 135 140 

Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
145 150 155 160 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu He Gly Pro 
165 170 175 

Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 

U 185 190 

iie Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr He 
200 205 

f4l Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
% 210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 
225 230 235 240 

m 

©Ju Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 
hA 245 250 255 

m 

-mu Val Pro He Gly Lys Cys He Cys Lys Ala Gly Tyr Gin Gin Lys 

nj 260 265 270 

Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Arg 

290 295 300 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 
305 310 315 320 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 
325 330 335 

Asn Leu He Phe Asn He Asn Gin Thr Thr Val Ser Leu Glu Trp Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg He Leu 
355 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Gly Ser 
370 375 380 



Asn He Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 
385 390 395 400 

Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 
405 410 415 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
420 425 430 

Ala Val Ser He Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 
435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 
450 455 460 

Glu Pro Glu His Pro Asn Gly Val He Thr Glu Tyr Glu He Lys Tyr 
tl^ 470 475 480 

•Qr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 

ti 485 490 495 

"~i 

Mr Thr Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 

^ 500 505 510 

pRe Gin He Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 

L 515 520 525 

Mg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 
H 530 535 540 

m 

I^a Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He Ala Val 
S|5 550 555 560 

Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe Gly Phe 
565 570 575 

He He Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly 
580 585 590 

Asp Glu Glu Leu Tyr Phe His Ser Leu Tyr Arg Glu Arg Gly Asp Glv 
595 600 605 

Met Glu Lys Thr Gin His Asn Lys Lys Trp Met He Ala Ser Cys Ser 
610 615 620 

Arg Leu 
625 
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(1) INFORMATION FOR SEQUENCE ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2323 

(B) TYPE: nucleic acid 

(C) STRANDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



AAGCGGCCGG 


TCTGCAGTCG 


GAGACTTGCA 


GGCAGCAAAC 


ACGGTGCGAA 


50 


Q3AACCGGAG 


GGGGGAGAGA 


GAAATCAAAC 


AGCTAAGCGT 


GGAGCAGACG 


100 


ftcTGGGACC 


CAGAAGGGGA 


TCGATGCGAG 


GAGCGCAATA 


ATAACAACAA 


150 


JKataaccca 


CTTCGGAGCA 


AACAGCATCT 


AAAGAGCTGC 


GACCCAACTG 


200 


.qVGCCTAAAA 


AAATCAAACC 


TGCTCATGCA 


CCATGGTTGT 


TCAAACTCGG 


250 


ftcCCTTCGT 


GGATTATTTT 


GTGTTACATC 


TGGCTGCTTG 


GCTTTGCACA 


300 


il^CGGGGGAG 


GCGCAGGCTG 


CGAAGGAAGT 


ACTATTACTG 


GACTCGAAAG 


350 


bkCAACAAAC 

m 


AGAATTGGAA 


TGGATTTCCT 


CTCCACCCAG 


TGGGTGGGAA 


400 


SkAATTAGTG 


GTTTGGATGA 


GAACTACACT 


CCGATAAGAA 


CATACCAGGT 


450 


GTGCCAGGTC 


ATGGAGCCCA 


ACCAGAACAA 


CTGGCTGCGG 


ACTAACTGGA 


500 


TTTCTAAAGG 


CAACGCACAA 


AGGATTTTTG 


TAGAATTGAA 


ATTCACCTTG 


550 


AGGGATTGTA 


ATAGTCTTCC 


CGGAGTCCTG 


GGAACTTGCA 


AGGAAACGTT 


600 


TAATTTGTAC 


TATTATGAAA 


CAGACTACGA 


CACCGGCAGG 


AATATACGAG 


650 


AAAACCTTTA 


TGTTAAAATA 


GACACCATTG 


CTGCAGATGA 


AAGTTTCACA 


700 


CAAGGTGACC 


TTGGTGAAAG 


AAAGATGAAG 


CTGAACACTG 


AGGTGAGAGA 


750 


GATTGGACCT 


TTGTCCAAAA 


AGGGATTGTA 


TCTTGCCTTT 


CAGGATGTAG 


800 


GGGCTTGCAT 


AGCATTGGTT 


TCTGTCAAAG 


TGTACTACAA 


GAAGTGCTGG 


850 


ACCATTGTTG 


AGAACTTAGC 


TGTCTTTCCA 


GATACAGTGA 


CTGGTTCGGA 


900 


ATTTTCCTCC 


TTAGTCGAGG 


TCCGTGGGAC 


ATGTGTCAGC 


AGTGCCGAGG 


950 


AAGAGGCAGA 


AAATTCCCCC 


AGAATGCATT 


GCAGTGCAGA 


AGGAGAGTGG 


1000 


CTAGTACCCA 


TTGGAAAATG 


CATCTGCAAA 


GCAGGCTATC 


AGCAAAAAGG 


1050 
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GGACACTTGC 


GAACCCTGTG 


GCCGCAGGTT 


CTACAAATCT 


TCCTCTCAGG 


1100 


ATCTCCAGTG 


TTCTCGTTGT 


CCAACCCACA 


GCTTCTCTGA 


CCGAGAAGGA 


1150 


TCATCCAGGT 


GTGAATGTGA 


AGATGGGTAC 


TACAGAGCTC 


CTTCTGATCC 


1200 


ACCATACGTT 


GCATGCACGA 


GGCCTCCCTC 


TGCACCACAG 


AACCTTATTT 


1250 


TCAATATCAA 


TCAAACGACT 


GTAAGTTTGG 


AATGGAGTCC 


TCCGGCTGAC 


1300 


AACGGGGGAA 


GAAACGATGT 


CACCTACAGA 


ATACTGTGTA 


AGCGGTGCAG 


1350 


TTGGGAACAG 


GGAGAATGTG 


TGCCATGCGG 


AAGTAACATT 


GGATACATGC 


1400 


CCCAGCAGAC 


GGGATTAGAG 


GATAACTATG 


TCACTGTCAT 


GGACCTACTT 


1450 


(|gCCATGCAA ATTACACTTT 


CGAAGTTGAA 


GCTGTAAATG 


GAGTTTCGGA 


1500 


^TAAGCAGA 


TCCCAGAGGC 


TCTTCGCTGC 


TGTTAGCATC 


ACCACCGGTC 


1550 


Hgcagctcc 


CTCGCAAGTG 


AGTGGAGTCA 


TGAAGGAGCG 


AGTACTGCAG 


1600 


C^GAGTGTGC 


AGCTTTCCTG 


GCAGGAGCCG 


GAGCATCCCA 


ATGGAGTCAT 


1650 


cficGGAATAT 


GAAATCAAGT 


ATTATGAGAA 


AGATCAACGG 


GAAAGGACGT 


1700 


4^TCAACACT 


CAAAACCAAG 


TCCACCTCCG 


CCTCCATTAA 


TAATCTGAAA 


1750 


cteGGGAACAG 

m 


TGTACGTCTT 


TCAGATCCGG 


GCGGTCACTG 


CTGCCGGTTA 


1800 


TpGAAACTAC 


AGCCCTAGGC 


TTGATGTTGC 


CACACTTGAG 


GAAGCTTCAG 


1850 


GTAAAATGTT 


TGAAGCGACA 


GCAGTCTCCA 


GTGAACAGAA 


TCCTGTCATC 


1900 


ATAATTGCTG 


TAGTGGCTGT 


AGCAGGGACC 


ATCATCTTGG 


TGTTCATGGT 


1950 


GTTCGGCTTC 


ATCATTGGAA 


GAAGGCACTG 


TGGTTATAGC 


AAGGCTGACC 


2000 


AAGAAGGGGA 


TGAAGAACTC 


TACTTTCATT 


CTCTTTACAG 


GGAAAGGGGA 


2050 


GACGGGATGG 


AAAAGACACA 


GCACAATAAG 


AAGTGGATGA 


TTGCATCGTG 


2100 


CTCTCGTTTG 


TAGGTCTCTT 


TTCCTAATCA 


ACACTATGAT 


TTTGAAGTAC 


2150 


GCGTACACGA 


AGCAAACGGG 


AAGAGATAAG 


GAATTAGCAT 


TGTGAACCTG 


2200 


ACTGTAATCC 


TCTCTTCCGG 


AAAGAGATGA 


GATGCTATTG 


CGATGAGAAT 


2250 


GTACAACTTG 


CACCTTGAAA 


TCTTTTTTGA 


TAATTAGTGC 


TCAGGGGAGG 


2300 


GGGGGGGAAG 


TAGAGAAAGC 


AAA 
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(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 



amino acid 

single 

linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Ala Ala Thr Ala Ala Ala 
5 



INFORMATION FOR SEQ ID NO : 8: 

Cj (i) SEQUENCE CHARACTERISTICS: 

S (A) LENGTH: 

If! (B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



(ii) MOLECULE TYPE: 

(xi) SEQUENCE DESCRIPTION: 



Ala Ala Thr Ala Ala Ala 

5 



amino acid 

single 

linear 



peptide 
SEQ ID NO: 8 



(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 

His Arg Asp Leu Ala Ala 
5 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 
(ix) FEATURE: 

(D) OTHER INFORMATION: 



amino acid 

single 

linear 

peptide 



Xaa in position 2 is valine or 
methionine; Xaa in position 5 is 
phenylalanine or tyrosine. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Xaa Trp Ser Xaa Gly 
5 



s 
m 



